AITopics | Juneau City and Borough

Collaborating Authors

Juneau City and Borough

A Good Plan is Hard to Find: Aligning Models with Preferences is Misaligned with What Helps Users

Balepur, Nishant, Shu, Matthew, Sung, Yoo Yeon, Goldfarb-Tarrant, Seraphina, Feng, Shi, Yang, Fumeng, Rudinger, Rachel, Boyd-Graber, Jordan Lee

arXiv.org Artificial IntelligenceSep-24-2025

To assist users in complex tasks, LLMs generate plans: step-by-step instructions towards a goal. While alignment methods aim to ensure LLM plans are helpful, they train (RLHF) or evaluate (ChatbotArena) on what users prefer, assuming this reflects what helps them. We test this with Planorama: an interface where 126 users answer 300 multi-step questions with LLM plans. We get 4388 plan executions and 5584 comparisons to measure plan helpfulness (QA success) and user preferences on plans, and recreate the setup in agents and reward models to see if they simulate or prefer what helps users. We expose: 1) user/model preferences and agent success do not accurately predict which plans help users, so common alignment feedback can misalign with helpfulness; 2) this gap is not due to user-specific preferences, as users are similarly successful when using plans they prefer/disprefer; 3) surface-level cues like brevity and question similarity strongly link to preferences, but such biases fail to predict helpfulness. In all, we argue aligning helpful LLMs needs feedback from real user interactions, not just preferences of what looks helpful, so we discuss the plan NLP researchers can execute to solve this problem.

helpfulness, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2509.18632

Country: North America > United States > Alaska > Juneau City and Borough (0.28)

Genre: Research Report > New Finding (0.93)

Industry:

Education (1.00)
Leisure & Entertainment (0.92)
Information Technology (0.67)
Government (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Exploring Multimodal Foundation AI and Expert-in-the-Loop for Sustainable Management of Wild Salmon Fisheries in Indigenous Rivers

Xu, Chi, Jin, Yili, Ma, Sami, Qian, Rongsheng, Fang, Hao, Liu, Jiangchuan, Liu, Xue, Ngai, Edith C. H., Atlas, William I., Connors, Katrina M., Spoljaric, Mark A.

arXiv.org Artificial IntelligenceMay-13-2025

Wild salmon are essential to the ecological, economic, and cultural sustainability of the North Pacific Rim. Y et climate variability, habitat loss, and data limitations in remote ecosystems that lack basic infrastructure support pose significant challenges to effective fisheries management. This project explores the integration of multimodal foundation AI and expert-in-the-loop frameworks to enhance wild salmon monitoring and sustainable fisheries management in Indigenous rivers across Pacific Northwest. By leveraging video and sonar-based monitoring, we develop AI-powered tools for automated species identification, counting, and length measurement, reducing manual effort, expediting delivery of results, and improving decision-making accuracy. Expert validation and active learning frameworks ensure ecological relevance while reducing annotation burdens. To address unique technical and societal challenges, we bring together a cross-domain, interdisciplinary team of university researchers, fisheries biologists, Indigenous stewardship practitioners, government agencies, and conservation organizations. Through these collaborations, our research fosters ethical AI co-development, open data sharing, and culturally informed fisheries management.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2505.06637

Country:

North America > Canada > Quebec > Montreal (0.05)
North America > Canada > British Columbia > Metro Vancouver Regional District > Burnaby (0.05)
Asia > China > Hong Kong (0.05)
(5 more...)

Genre: Research Report (0.64)

Industry: Food & Agriculture > Fishing (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.69)

Add feedback

Advancing Large Language Models for Spatiotemporal and Semantic Association Mining of Similar Environmental Events

Tian, Yuanyuan, Li, Wenwen, Hu, Lei, Chen, Xiao, Brook, Michael, Brubaker, Michael, Zhang, Fan, Liljedahl, Anna K.

arXiv.org Artificial IntelligenceNov-19-2024

Retrieval and recommendation are two essential tasks in modern search tools. This paper introduces a novel retrieval-reranking framework leveraging Large Language Models (LLMs) to enhance the spatiotemporal and semantic associated mining and recommendation of relevant unusual climate and environmental events described in news articles and web posts. This framework uses advanced natural language processing techniques to address the limitations of traditional manual curation methods in terms of high labor cost and lack of scalability. Specifically, we explore an optimized solution to employ cutting-edge embedding models for semantically analyzing spatiotemporal events (news) and propose a Geo-Time Re-ranking (GT-R) strategy that integrates multi-faceted criteria including spatial proximity, temporal association, semantic similarity, and category-instructed similarity to rank and identify similar spatiotemporal events. We apply the proposed framework to a dataset of four thousand Local Environmental Observer (LEO) Network events, achieving top performance in recommending similar events among multiple cutting-edge dense retrieval models. The search and recommendation pipeline can be applied to a wide range of similar data search tasks dealing with geospatial and temporal data. We hope that by linking relevant events, we can better aid the general public to gain an enhanced understanding of climate change and its impact on different communities.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2411.1288

Country:

North America > United States > Alaska > Kodiak Island Borough > Kodiak (0.04)
Pacific Ocean > North Pacific Ocean > Cook Inlet (0.04)
North America > United States > Alaska > Sitka City and Borough > Sitka (0.04)
(13 more...)

Genre: Research Report > New Finding (1.00)

Industry: Media > News (0.67)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Spatial Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

The impact of spatio-temporal travel distance on epidemics using an interpretable attention-based sequence-to-sequence model

Jiang, Yukang, Tian, Ting, Xie, Huajun, Guo, Hailiang, Wang, Xueqin

arXiv.org Artificial IntelligenceNov-12-2023

Amidst the COVID-19 pandemic, travel restrictions have emerged as crucial interventions for mitigating the spread of the virus. In this study, we enhance the predictive capabilities of our model, Sequence-to-Sequence Epidemic Attention Network (S2SEA-Net), by incorporating an attention module, allowing us to assess the impact of distinct classes of travel distances on epidemic dynamics. Furthermore, our model provides forecasts for new confirmed cases and deaths. To achieve this, we leverage daily data on population movement across various travel distance categories, coupled with county-level epidemic data in the United States. Our findings illuminate a compelling relationship between the volume of travelers at different distance ranges and the trajectories of COVID-19. Notably, a discernible spatial pattern emerges with respect to these travel distance categories on a national scale. We unveil the geographical variations in the influence of population movement at different travel distances on the dynamics of epidemic spread. This will contribute to the formulation of strategies for future epidemic prevention and public health policies.

average weight, death, travel distance, (15 more...)

arXiv.org Artificial Intelligence

2206.02536

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.18)
North America > United States > Hawaii > Honolulu County > Honolulu (0.07)
North America > United States > Florida > Indian River County (0.05)
(28 more...)

Genre: Research Report > New Finding (0.54)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Public Health (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback

Active Retrieval Augmented Generation

Jiang, Zhengbao, Xu, Frank F., Gao, Luyu, Sun, Zhiqing, Liu, Qian, Dwivedi-Yu, Jane, Yang, Yiming, Callan, Jamie, Neubig, Graham

arXiv.org Artificial IntelligenceOct-21-2023

Despite the remarkable ability of large language models (LMs) to comprehend and generate language, they have a tendency to hallucinate and create factually inaccurate output. Augmenting LMs by retrieving information from external knowledge resources is one promising solution. Most existing retrieval augmented LMs employ a retrieve-and-generate setup that only retrieves information once based on the input. This is limiting, however, in more general scenarios involving generation of long texts, where continually gathering information throughout generation is essential. In this work, we provide a generalized view of active retrieval augmented generation, methods that actively decide when and what to retrieve across the course of the generation. We propose Forward-Looking Active REtrieval augmented generation (FLARE), a generic method which iteratively uses a prediction of the upcoming sentence to anticipate future content, which is then utilized as a query to retrieve relevant documents to regenerate the sentence if it contains low-confidence tokens. We test FLARE along with baselines comprehensively over 4 long-form knowledge-intensive generation tasks/datasets. FLARE achieves superior or competitive performance on all tasks, demonstrating the effectiveness of our method. Code and datasets are available at https://github.com/jzbjyb/FLARE.

computational linguistic, interpretation, retrieval, (15 more...)

arXiv.org Artificial Intelligence

2305.06983

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Oceania > Australia (0.05)
Europe > Ireland (0.04)
(27 more...)

Genre: Research Report (0.70)

Industry:

Government (1.00)
Media > Film (0.96)
Leisure & Entertainment > Games > Computer Games (0.70)
Education (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Hyperbolic Image-Text Representations

Desai, Karan, Nickel, Maximilian, Rajpurohit, Tanmay, Johnson, Justin, Vedantam, Ramakrishna

arXiv.org Artificial IntelligenceJun-5-2023

Visual and linguistic concepts naturally organize themselves in a hierarchy, where a textual concept "dog" entails all images that contain dogs. Despite being intuitive, current large-scale vision and language models such as CLIP do not explicitly capture such hierarchy. We propose MERU, a contrastive model that yields hyperbolic representations of images and text. Hyperbolic spaces have suitable geometric properties to embed tree-like data, so MERU can better capture the underlying hierarchy in image-text datasets. Our results show that MERU learns a highly interpretable and structured representation space while being competitive with CLIP's performance on standard multi-modal tasks like image classification and image-text retrieval.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2304.09172

Country:

Europe > Austria > Vienna (0.14)
North America > United States > New York > New York County > New York City (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
(38 more...)

Genre: Research Report > New Finding (0.86)

Industry:

Leisure & Entertainment (1.00)
Consumer Products & Services (1.00)
Energy (0.67)
(2 more...)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(4 more...)

Add feedback

Bot or Human? Detecting ChatGPT Imposters with A Single Question

Wang, Hong, Luo, Xuan, Wang, Weizhi, Yan, Xifeng

arXiv.org Artificial IntelligenceMay-16-2023

Large language models like ChatGPT have recently demonstrated impressive capabilities in natural language understanding and generation, enabling various applications including translation, essay writing, and chit-chatting. However, there is a concern that they can be misused for malicious purposes, such as fraud or denial-of-service attacks. Therefore, it is crucial to develop methods for detecting whether the party involved in a conversation is a bot or a human. In this paper, we propose a framework named FLAIR, Finding Large language model Authenticity via a single Inquiry and Response, to detect conversational bots in an online manner. Specifically, we target a single question scenario that can effectively differentiate human users from bots. The questions are divided into two categories: those that are easy for humans but difficult for bots (e.g., counting, substitution, positioning, noise filtering, and ASCII art), and those that are easy for bots but difficult for humans (e.g., memorization and computation). Our approach shows different strengths of these questions in their effectiveness, providing a new way for online service providers to protect themselves against nefarious activities and ensure that they are serving real users. We open-sourced our dataset on https://github.com/hongwang600/FLAIR and welcome contributions from the community to enrich such detection datasets.

arxiv preprint arxiv, large language model, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2305.06424

Country:

North America > United States > Wyoming > Laramie County > Cheyenne (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > West Virginia > Kanawha County > Charleston (0.04)
(5 more...)

Genre: Research Report > New Finding (0.46)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Eelgrass wasting disease has new enemies: Drones and artificial intelligence

#artificialintelligenceSep-21-2018, 12:11:00 GMT

"There are a number of seagrass monitoring programs that work on regional and to some degree on global scales, but most of them are really only looking at the cover and the abundance of the seagrass itself," said Emmett Duffy, director of the Marine Global Earth Observatories (MarineGEO) headquartered at the Smithsonian Environmental Research Center. The new grant builds on collaborative work by the Zostera Experimental Network (ZEN), led by Duffy, and will look at how climate, biodiversity and other environmental aspects can change the course of the disease. The team is deploying a wide arsenal of weapons to understand it: In addition to marine biologists, they are bringing on geographers, computer scientists, artificial intelligence and drones. Seagrasses are among the most valuable ecosystems on Earth. They provide habitat for popular fish like salmon and herring, protect shorelines from erosion and filter out nutrient pollution.

artificial intelligence, eelgrass, eelgrass wasting disease, (13 more...)

#artificialintelligence

Country:

Europe (0.06)
North America > United States > Washington (0.05)
North America > United States > California > Yolo County > Davis (0.05)
(2 more...)

Industry: Health & Medicine (0.47)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.40)

Add feedback